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ABSTRACT 

Specific promoter recognition by bacterial RNA 
polymerase is mediated by <r subunits, which 
assemble with RNA polymerase core enzyme (E) 
during transcription initiation. However, a 70 (the 
housekeeping a subunit) and <r s (an alternative a 
subunit mostly active during slow growth) recognize 
almost identical promoter sequences, thus raising 
the question of how promoter selectivity is 
achieved in the bacterial cell. To identify novel 
sequence determinants for selective promoter rec- 
ognition, we performed run-off/microarray (ROMA) 
experiments with RNA polymerase saturated either 
with o 70 (Ea 70 ) or with o s (Eo s ) using the whole 
Escherichia coli genome as DNA template. We 
found that Ea 70 , in the absence of any additional 
transcription factor, preferentially transcribes 
genes associated with fast growth (e.g. ribosomal 
operons). In contrast, E<r s efficiently transcribes 
genes involved in stress responses, secondary me- 
tabolism as well as RNAs from intergenic regions 
with yet-unknown function. Promoter sequence 
comparison suggests that, in addition to different 
conservation of the -35 sequence and of the UP 
element, selective promoter recognition by either 
form of RNA polymerase can be affected by the A/ 
T content in the -10/+1 region. Indeed, site-directed 
mutagenesis experiments confirmed that an A/T 
bias in the -10/+1 region could improve promoter 
recognition by Eo s . 



INTRODUCTION 

Bacteria must cope with drastic changes in their environ- 
ment, such as nutritional up- and downshifts, and vari- 
ations in pH, osmolarity and temperature. Bacterial cells 
can quickly adapt to such environmental changes by 
modulating gene expression, at both transcriptional and 
post-transcriptional levels. At the transcription initiation 
level, gene expression can be regulated either through ac- 
cessory transcription factors (activators and repressors), 
or via assembly of different forms of RNA polymerase. 
The latter mechanism of gene regulation involves the 
assembly of RNA polymerase core enzyme (indicated as 
E) with one of several ct factors that can direct RNA poly- 
merase to specific promoter sequences (1). Typically, in 
the bacterial cell, one cr factor is devoted to transcription 
of a large part of the genome, including the essential 
cellular functions (housekeeping ct factor), while the 
so-called 'alternative ct factors' direct transcription of 
smaller sets of genes, often linked to specific functions 
(e.g. response to cellular stresses). 

In Escherichia coli, seven ct factors have been identified: 
cr 70 or ct d (the housekeeping ct) and six alternative a 
factors: ct e , a F , a H , ct 1 , ct and ct s (2). Most alternative 
ct factors recognize promoter sequences that strongly 
diverge from the consensus sequence for ct 70 ; in contrast, 
genes under the control of ct s are characterized by 
promoter sequences very similar to CT 70 -dependent genes 
(3,4). In line with this observation, in vitro selection of 
DNA sequences bound with high affinity by RNA poly- 
merase associated with ct s (Ect s ) led to the identification of 
a consensus sequence very similar to the one recognized by 
ct 70 (5). Some level of overlapping in promoter recognition 
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by a 70 and a s might be consistent with a s function: 
indeed, in conditions leading to slow metabolic activity, 
such as nutrient starvation or oxidative stress, a s might 
take over in transcription of genes important for cell 
survival that are under a 70 control during faster growth 
(6). However, in order to switch from the fully active to 
the slow metabolic state, specific gene expression, and thus 
specific recognition of a 70 - versus a -dependent pro- 
moters, must take place in the bacterial cell. Some 
promoter sequence determinants can favour recognition 
by either a or a s (3): for instance, a C nucleotide 
upstream of the —10 promoter element (— 13C) enhances 
transcription by Ea (7). However, some sequence 
features favouring promoter recognition by a seem to 
be dependent on specific promoter contexts: for instance, 
at the Ecr s -dependent aidB promoter, Ea s , but not Ea 70 , 
can recognize with equal efficiency either C or T as the 
first nucleotide in the —10 promoter element (8). However, 
the percentage of a s -dependent promoters carrying 
a — 12C element is not significantly higher than in 
<r 70 -dependent promoters (9), suggesting that the 
presence of a — 12C might only contribute to specific 
promoter recognition by a s at selected promoters. 

In addition to sequence determinants, it has been 
proposed that transcription factors such as CRP, IHF 
and Lrp can selectively block (or promote) promoter rec- 
ognition by either Ea 0 or Ea s (10). A transcription regu- 
lator important for the modulation of promoter 
accessibility to different RNA polymerase holoenzymes 
is the H-NS protein, which can repress transcription by 
Ea 70 , but not by Ea s , at various promoters (11), a phe- 
nomenon known as transcriptional silencing (12). Specific 
promoter recognition by Ea is also affected by the degree 
of DNA supercoiling (13). In addition, a s activity and 
intracellular concentrations are affected by various 
factors, such as the presence of an anti-sigma factor for 
a 70 (14), and by the accumulation of the signal molecules 
ppGpp (15) and polyphosphate (3). 

Work aimed to the identification of a s -specific 
promoter elements has mostly been carried out in vivo, 
comparing relative gene expression in a wild type versus 
an rpoS mutant derivative unable to produce the a s 
protein (9,16-20). Although this approach has proven 
very useful for the identification of /-/joS-dependent 
genes and in the characterization of a s -specific promoter 
elements such as the — 13C, it cannot distinguish between 
promoters directly recognized by the Ea s form of RNA 
polymerase and promoters under the indirect control of 
the rpoS gene. In contrast, dependence on Ea s , as 
determined by biochemical experiments with purified 
RNA polymerase, has only been determined for a 
limited number of promoters [e.g. fic (21), csiD (22) and 
aidB (23)]. In this work, we have performed in vitro tran- 
scription experiments with either Ea 70 or Ea s , using the 
whole E. coli genome as template, to identify promoter 
regions selectively recognized by two forms of RNA poly- 
merase. Our results support previous observations that 
Ea 70 - and Ea s -dependent promoters differ in conserva- 
tion of the UP element and of the —35 sequence, and in 
the sequence immediately upstream of the —10 promoter 
element. In addition, we show that differences in the A/T 



content in the — 10/+1 promoter region can favour tran- 
scription by either form of RNA polymerase. Finally, our 
work has led to the identification of novel a s -dependent 
genes, thus providing further insight on the physiological 
role of a s . 

MATERIALS AND METHODS 

Protein purification and RNA polymerase reconstitution 

Escherichia coli RNA polymerase core enzyme was pur- 
chased from Epicentre (Madison, WI, USA); histidine- 
tagged a factors were produced and purified as described 
(8,24). The a s protein appeared totally pure from contam- 
inants as determined by denaturing protein gel electro- 
phoresis; in contrast, in the a 0 preparations the 
presence of faint additional bands, corresponding to the 
molecular weight of the core RNA polymerase subunits a, 
p and p', could be detected (data not shown). Weak con- 
tamination of a 70 preparations by core RNA polymerase 
subunits is consistent with the high affinity of a 7 for the 
core enzyme. For reconstitution of RNA polymerase 
holoenzymes, the core enzyme was incubated for lOmin 
at 37°C with either a s or a 70 at a 1:10 ratio. For calcula- 
tion of RNA polymerase concentrations in transcription 
assays, it was assumed that, after reconstitution, core 
enzyme would be 100% active and fully saturated by 
either a factor. 

In vitro transcription on supercoiled plasmids 

Promoter regions of interest were amplified from the gen- 
ome of E. coli MG1655 (25) and cloned into the pJCDOl 
plasmid (22) using the BamHI and EcoRI sites, with the 
exception of the ilvY promoter, which was cloned using 
the BamHI and SphI sites, due to the presence of an 
EcoRI site in the ilvY promoter region. Single-round 
in vitro transcription experiments were carried out on 
supercoiled templates (3nM) in the presence of lOnM 
reconstituted RNA polymerase holoenzyme. Plasmid 
DNA and reconstituted holoenzyme were incubated for 
lOmin at 37°C in 18 ul of transcription buffer (40mM 
HEPES pH 8.0, lOmM magnesium chloride, 150 mM po- 
tassium glutamate, 2mM dithiothreitol, lOOug/ml bovine 
serum albumin) prior to the addition of 2 ul of a mixture 
of ribonucleotide triphosphates and heparin to a final con- 
centration of 500 uM each for ATP, CTP, GTP and UTP 
and 250 ug/ml heparin. Transcription reactions were 
allowed to proceed for lOmin at 37°C and were stopped 
by addition of NaCl to a final concentration of 0.5 M 
followed by incubation at 70° C for 5min. Samples were 
extracted with a 1:1 phenol-chlorophorm mixture, 
precipitated with ethanol and resuspended in 12 ul TE 
buffer. A volume of 2.5 ul of resuspended samples were 
treated with DNasel (1 U in lOmM Tris-HCl pH 7.6, 
2.5 mM MgCl 2 , 0.5 mM CaCl 2 , in a final volume of 
20 ul) for lh at 37°C, and DNasel was heat-inactivated 
at 65°C for lOmin. Transcript amounts were determined 
by quantitative real-time polymerase chain reaction 
(PCR): 1 ul of DNasel-treated samples was 
retrotranscribed and the cDNAs were amplified in quan- 
titative real-time PCR using the RNA-I transcript as 
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reference gene. The sequences of the primers used in 
real-time PCR are available upon request. 
DNasel-treated transcription reactions not incubated 
with reverse transcriptase were added as negative 
controls for the presence of undigested plasmid DNA. 
To verify that transcripts would indeed originate from 
the promoters of interest, and not from non-specific tran- 
scription initiation events, in vitro transcription start sites 
were determined by Rapid Amplification of cDNA Ends 
(RACE) analysis as described (26): start sites observed in 
the in vitro transcription experiments corresponded to 
those described in the literature (data not shown). 
Site-directed mutagenesis of the ssrS PI promoter was 
performed using the three-step PCR method (27). 

Run-off microarray experiments on E. coli genome 

Genomic DNA from E. coli MG1655 was isolated with 
GenElute Bacterial Genomic DNA Kit (Sigma) and di- 
gested with EcoRI overnight. The size of digested genomic 
DNA ranged from 0.6 to 30 kb as judged from visualiza- 
tion on agarose gels. After purification with Wizard SV 
Gel and PCR Clean-Up System (Promega), 1 ug of 
genomic DNA was used in single transcription assay for 
run-off micro array (ROMA) experiments. RNA polymer- 
ase holoenzyme concentrations were 100 nM in 50-ul 
reaction mixture. Digested genomic DNA and reconst- 
ituted holoenzyme were incubated in transcription buffer 
(40 mM HEPES pH 8.0, 10 mM magnesium chloride, 
150mM potassium glutamate, 2mM dithiothreitol, 
lOOug/ml bovine serum albumin) for 15min at 37°C to 
allow binding of RNA polymerase to DNA. Transcription 
reactions were started with the addition of 2.5 ul of ATP, 
CTP, GTP and UTP to a final concentration of 500 uM 
each, were allowed to proceed for 30min at 37°C and were 
stopped by incubation at 65°C for 5min. The reaction 
mixture was incubated with 10 U of DNasel at 37°C 
for 30min to degrade genomic DNA. RNA was purified 
from reaction with miRNeasy Mini Kit (Qiagen), which 
allows recovering of total RNA, including small RNA 
molecules (<200nt). The quantity of RNA produced by 
in vitro transcription was determined using Nanodrop 
Spectrophotometer, while the average length of the tran- 
scripts, as well as of the corresponding cDNA from 
reverse transcription (see below), was evaluated by capil- 
lary electrophoresis on an Agilent Bioanalyzer 2100 using 
an RNA pico and nano assay, respectively. Capillary elec- 
trophoresis analysis of transcription reactions performed 
in the presence of Ea s , Ea 70 or core RNA polymerase 
clearly showed that, while the average transcript size 
generated by any form of RNA polymerase was compar- 
able, a significantly lower amount of RNA was produced 
in transcription reactions performed with core RNA poly- 
merase alone compared to either form of RNA polymer- 
ase holoenzyme (Supplementary Figure SI). Indeed, 
in vitro transcription reactions performed using RNA 
polymerase core enzyme generated 4ng/ul of transcripts, 
versus 3 1 ng/ul and 23 ng/ul total RNA produced in the 
presence of Ea 70 and Ect s , respectively. Dependence 
on the presence of a cr factor for efficient in vitro transcrip- 
tion strongly suggests that, in our experimental 



conditions, transcription initiation does not originate 
from non-specific initiation sites such as 5' DNA 
termini, but requires interaction of RNA polymerase 
holoenzyme with promoter sequences. Ten independent 
run-off transcription assays with either Ea s or Ect 70 
were performed, and transcripts were pooled together 
(~1 ug total RNA). From the pool of the 10 independent 
multi-round transcription reactions two distinct hybrid- 
izations on microarrays were performed. 

For hybridization onto microarrays, we used the 
Affymetrix Genechip E. coli Genome 2.0 array, which 
includes 10 144 probe sets covering all the predicted tran- 
scripts from four strains of E. coli: the laboratory strain 
MG1655, the uropathogenic CFT073 and the enter- 
opathogenic strain OH157:N7, subtypes EDL933 and 
SAKAI. Due to the high degree of similarity between 
the E. coli strains, typically a single probe set is tiled to 
represent the equivalent orthologue in all four strains, and 
strain-specific probes are only used for genes displaying 
low levels of conservation, or present solely in one 
strain. Probe sets match every open reading frame 
(ORF) in E. coli; in addition, 1427 probe sets targeting 
714£". coli MG1655 intergenic regions, probes for various 
antibiotic resistance markers, and additional control and 
reporter genes from the previous generation E. coli arrays 
are also represented in the Affymetrix Genechip E. coli 
Genome 2.0 array. 

The RNA samples were processed for microarray hy- 
bridization, following the instructions of the GeneChip® 
Expression Analysis Technical Manual (Chapter 5, 6 and 
7, Prokaryotic target Preparation, Hybridization, 
Washing, Staining and Scanning), except that cDNA syn- 
thesis by reverse transcription was performed with 1 ug 
RNA. cDNA was fragmented with DNase I treatment 
and labelled with biotin using Terminal Deoxynucleotidyl 
Transferase. The fragmented and labelled cDNAs were 
then hybridized for 16 h at 45°C on individual E. coli 
Genome 2.0 arrays. After hybridization, GeneChips were 
washed and stained with streptavidin-conjugated phyco- 
erythrin by using the Fluidic Station FS450 (Affymetrix) 
according to the FS450_0006 Protocol. Fluorescent 
images of the arrays were acquired using a GeneChip 
Scanner 3000 7G (Affymetrix). All Chip images and files 
have been deposited in the GEO (Gene Expression 
Omnibus) (http://www.ncbi.nlm.nih.gov/geo/) reposi- 
tory (accession number:GSE22207). After quality con- 
trol of data distribution, the raw data (CEL files) were 
used to perform normalization and probe set summar- 
ization through Robust Multiarray Analysis (RMA) 
algorithm (by using the Affymetrix Gene Expression 
Console Software (www.affymetrix.com). Normalized 
data were also analyzed with OneChannelGUI Software 
(http://www.bioinformatica.unito.it/oneChannelGUI/) in 
order to perform an additional expression analysis 
based on different parameters. OneChannelGUI is an 
add-on Bioconductor package extending the capability of 
the affylmGUI package (28); it is a library providing 
a graphical interface (GUI) for Bioconductor libraries to 
be used for the complete single-channel microarray 
analysis. 
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To perform statistic analysis on the two hybridizations, 
we decided to apply a simple non-parametric statistical 
method based on ranks of fold changes to perform a 
two-class paired differential analysis. To select genes 
with significantly different transcription levels, we set the 
following parameters for the Rank Product analysis: 100 
permutations and 0.1 cut-off percentage of false positives 
(pfp) which corresponds to a P-value <0.01. Fold differ- 
ences higher than 1.5-fold were considered indicative 
of significantly different expression, similar to previous 
ROMA experiments (29): genes more efficiently 
transcribed by either Ea or Ea 70 are listed in Tables 1 
and 2, respectively. Determination of start sites on tran- 
scripts generated in the in vitro transcription assays was 
carried out by RACE analysis (26). 

In order to manage and retrieve microarray data, a 
genome browser was set up (http://155.253. 6. 64/cgi-bin/ 
gbrowse/provakappa/#search). The genome browser is 
based on the Generic Genome Browser (Gbrowse) which 
is a combination of database and interactive webpage for 
manipulating and displaying annotations on genomes. 
This bioinformatic tool allows users to view and navigate 
through the MG1655 genome (GenBank Accession 
Number: U00096) with information about the gene anno- 
tation, coming from the website http://www.genome.wisc 
.edu/tools/asap.htm (downloaded as GFF3 file), and all 
the probe sets contained in the Affymetrix E. coli 
Genome 2.0 Array manually remapped on the genome. 

Determination of r/wS'-dependent gene expression in 
bacterial cells 

Bacterial strains used were MG1655 (wild type) and its 
rpoS mutant derivative EB1.3 (30). Strains were grown 
at 30°C in three different media: the complex Luria 
Bertani (LB) broth, the glucose-based M9Glu/sup 
medium (31) and LB medium diluted 1:4 (LB1/4). The 
LB 1/4 medium was utilized since it was shown to stimu- 
late expression of r/wS-dependent genes such as the csg 
operons (32). Thus, growth in LB 1/4 medium might posi- 
tively affect either a s concentration or a s activity. 
Samples for RNA extraction were taken both in late loga- 
rithmic (OD 60 o = 0.6-0.7) and in late stationary phase 
(overnight cultures, OD 60 o>1.5). RNA was extracted 
using the small RNA miRNeasy Mini Kit (Qiagen), and 
further reverse transcription and cDNA amplification in 
quantitative real-time-PCR were performed as described 
(33). Primer sequences are available upon request. All re- 
actions were performed twice, each time in duplicate, and 
always showed very similar results. The relative amounts 
of the transcripts were determined using 16S rRNA as the 
reference gene ([Ct Gene of interest - Ct 16S ] = ACt). 

RESULTS 

ROMA analysis of <t s - and o 70 -dependent promoters 

To identify sequence determinants that can direct se- 
lective promoter recognition by either the a 70 - or the 
a - associated form of RNA polymerase, we compared 
Ea s and Ea 70 in ROMA experiments (29,34). Run-off 
transcription assays were performed using as DNA 



template the whole genome of E. coli MG1655 after diges- 
tion with EcoRI; cDNAs generated from the transcripts 
were hybridized on microarrays to determine their relative 
amounts. Unlike genome expression studies performed in 
living cells, which do not distinguish between direct and 
indirect effects, such as transcription factor-dependent 
promoter recognition, the ROMA analysis solely detects 
genes whose promoter are recognized by either Ea s or 
Ect 70 (or both) in the absence of any additional factor. 
Out of ~10000 probe sets on the microarray, only 173 
(~1.7%) showed differences in transcription levels 
higher than 1.5-fold, considered to be significant 
(pfp<0.1, /"-value <0.01; see 'Materials and Methods' 
section). For several operons, only a portion of the tran- 
scriptional unit (or even a single gene) showed significant 
difference in transcription levels by either Ea s or Ea 70 : in 
the case of reduced transcription of distal genes within a 
given operon, this effect might depend either on premature 
arrest of transcription by RNA polymerase or on the 
presence of an EcoRI restriction site within the operon, 
since the EcoRI restriction nuclease was used to digest 
chromosomal DNA used in ROMA experiments (see 
'Materials and Methods 1 section). However, the effects 
of EcoRI digestion were not so severe as it could be 
expected, possibly due to the fact that chromosomal 
DNA was only partially digested by the enzyme in our 
conditions. For instance, despite the presence of an 
EcoRI site in the ydhY gene, transcription extending 
into the downstream genes was still detectable (Table 1 
and Supplementary Table SI). In the case of distal genes 
of an operon showing more significant differences in Ea s - 
versus Ea 70 -dependent transcription than genes proximal 
to the promoter, this might be due to either lack of detec- 
tion of proximal genes in the microarray experiment or the 
presence of unidentified promoters internal to the 
operons. We verified that genes within the same operon 
would not show dependence on different forms of RNA 
polymerase: out of the several hundred operons in the 
E. coli genome, only in the weakly Ea 70 -dependent 
atpBEFHAGDC and flgBCDEFGHIJ operons could we 
observe the presence of a single gene showing dependence 
on Ecr s in our ROMA experiments (data not shown). This 
result might suggest the presence of Ea s -dependent pro- 
moters, or promoter-like sequences, within these operons; 
however, we decided to focus our investigation on operons 
consistently showing preferential recognition by either 
form of RNA polymerase, and the atpBEFHAGDC and 
flgBCDEFGHIJ operons were not considered further in 
our study. In Supplementary Table SI, we show the 
ratios of Ea - versus Ea 70 -dependent transcription for 
operons featuring genes preferentially transcribed in the 
presence of Ea s . 

Complete results of ROMA experiments are summar- 
ized in Tables 1 and 2, listing genes and intergenic regions 
transcribed more efficiently either by Ea s (Table 1) or by 
Ea 70 (Table 2). Out of the 54 genes preferentially trans- 
cribed by Ea s in ROMA experiments that are annotated 
in E. coli MG1655, 21 (39%) had already been described 
as r/wS-dependent genes, either from genetic character- 
ization or from microarray experiments comparing wild- 
type and rpoS mutant strains (underlined in Table 1). In 
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Table 1. Genes transcribed more efficiently in the presence of Eo s 



Genes and 
promoter regions" 


Gene product b 


b number" 


Transcription 
levels 
(Ecr/Ecr 
ratio) 


Known regulatory factors 6 


Stress response 








ct e (59); a s (19) 


yieF 


Chromate reductase 


b3713 


1.50 


IG1341353 1341620-r 


Intergenic region including osmB promoter 


N.A. 


1.51 


a s , RcsBA (60) 


osmB 


Osmolarity-inducible lipoprotein 


bl283 


1.51 


a s , RcsBA (60) 


gshA 


y-glutamate-cysteine ligase (glutathione 


b2688 


2.21 






biosynthesis) 








ahpF 


Alkyl-peroxidase reductase 


h0606 


1.52 


DwR (6n 

v y A v iv y\j i j 


pphA 


Protein phosphatase 


D 1 o jo 


i . j4 


rr H tfTi f\T\ 


macA 


Macrolide resistance efflux pump 


b0878 


1.54 




DNA and RNA metabolism and modification 








nnk 
I'l'" 


Polyphosphate kinase, part of RNA 


b2501 


1 50 


1? o (Ti i 1 'i lf\i* f\f i n t ri i*f*l 1 1 1 1 q r rr^ 
l\CgUldLUl Ol 111 LI acciiuidi O 




degradosome, 






cnticpn tru ti on s l3i 

1. V ' 1 IV. .1, 1 1 l 1 il 1 1 1. ' 1 1 > 1 J ( 


IG2522899_2523146-r 


Intergenic region including xapA promoter 


N.A. 


1.62 


XapR (64) 


xapA 


Xanthosine phosphorylase, nucleotide 


b2407 


1.56 


XapR (64) 




synthesis/degradation 








gyrB 


B subunit of DNA gyrase 


b3699 


1.65 


Fis (65) 


recT 


Rac prophage, recombinase, in 


bl349 


1.93 


a s -Dependent in biofilm-growinj 




recET-lar-ydaCQ operon 






cells (18) 


tar 


Rac prophage, restriction alleviation 


bl348 


1.93 






and modification enhancement, in 










recET-lar-ydaCQ operon 








ydaC 


Rac prophase, in recET-lar-ydaCQ operon 


bl347 


1.74 


as recT 


ydaQ 


Rac prophage, possible recombinase, in 


bl346 


1.66 


as recT 




recET-lar-ydaCQ operon 








Polyamine metabolism 










puuB 


y-glutamyl-putrescine oxidase 


bl 301 


1.61 


rr s (19t 


gabD 


Succinate semialdehyde dehydrogenase 


b2661 


1 65 


rr S TSJmp (Q (\(\\ 

o , i>ac (7,ooj 


speB 


Agmatinase (in arginine/putrescine degradation 


b2937 


1.66 






pathway) 








Transcription regulation 










lG5a2284_5o2903-f 


Intergenic region including appY promoter 


N.A. 


1 .69 


TJ MO 1 £H\ 

H-NS (6/) 


appY 


Regulator of hyaABCDEF operon 


b0564 


1.51 


H-NS (67} 


yciT 


Putative deoR-type transcription regulator 


bl284 


1.56 




ilvY 


Regulator of ilvC operon 


b3773 


1.75 




rhaS 


Regulator of rhamnose transport 


b3905 


1.81 


CRP (29V RhaS (68"! 


Sugar metabolism 










treF 


1 1 L 1 Ul lil >L 


b3519 


1.53 


a s (9 191 


ytjr 


Putative galactose ABC transporter 


b4230 


1.55 


a s (19) 


otsB 


Trehalose-6-phosphate phosphatise 


bl897 


1.56 


a s (7) 


araF 


Arabinose transporter, component of an ABC 


bl901 


1 57 


fRD Ai-aC (f>Q\- rr S ( 1 Q\ 




transport system 








glgP 


Glycogen phosphorylase, part of glgCAP 


b3428 


1.60 


CRP C70V rr s controls pIpCAP 




operon 






operon (19) 


gpmM 


Putative 2,3-bisphosphoglycerate mutase 


b3612 


1.61 




glvC 


Sugar transport phosphotransferase 


b3683 


1.79 




Other metabolic functions 










lG1030936_1031361-f 


Intergenic region including hyaABCDEF 


N.A. 


2.02 






promoter 








hyaA 


Hydrogenase small subunit; in hyaABCDEF 


b0972 


1.68 


rr^ A nnY 'A niiprnhip rptri 1 1 a ti nn 

KJ , / V I J If ± , 1 1 £1 w 1 UU1L 1 LllCL Llt_711 




operon 






by ArcA and NarP/NarL 










(20,71,72) 


hyaB 


Hydrogenase large subunit; in hyaABCDEF 


b0973 


1.66 


as hyaA 




operon 








hyaF 


Hydrogenase subunit (nickel-binding protein); 


b0977 


1.50 


as hyaA 




in hyaABCDEF operon 








syd 


SecY-interacting protein 


b2793 


1.59 




murP 


Acetyl-muramic acid permease 


b2429 


1.59 


CRP, MurR (73) 


murR 


Transcriptional repressor of murQP 


b2427 


1.62 




tarn 


Trans-aconitate methylatransferase 


bl519 


1.60 


a s (74) 


ydhY 


Predicted oxidoreductase, Fe-S protein; in 


bl674 


1.80 


FNR, NarL (75) 




ydhYVWXUT operon 








ydhV 


Predicted oxidoreductase, Fe-S protein; in 


b!673 


1.62 


FNR, NarL (75). cr s -dependent 




ydhYVWXUT operon 






in biofilm-growing cells (18) 



(continued) 
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Table 1. Continued 



Genes and Gene product* 5 b number 0 Transcription Known regulatory factors 6 

promoter regions" levels 

(Ea s /Eo 70 

ratio) d 



Unknown and miscellaneous functions 



yccT 


Unknown 


b0964 


1.51 


Induced in stationary phase (76) 


yffP 


Predicted protein, prophage 


b2447 


1.52 




,,;jf 

yiuii 


Putative membrane transporter 


djo /y 


1.53 




yn)D 


Predicted protein 


Dl JOO 


1.54 




yfiL 


Putative lipoprotein 


b2602 


1.55 




yi91a 


Unknown, in CP4-6 prophage sequence 


b0255 


1.56 




yfiL 


Unknown, possible prophage gene 


b2625 


1.57 


s 

CT (9) 


ygL 


Unknown, prophage protein 


b0278 


1.59 


a s -Dependent in biofilm-growing 










cells (18) 


eutA 


Reactivating factor for ethanolamine 


b2451 


1.59 


eutH, in eutHA operon, is 




ammonia lyase 






CT S -dependent (9) 


yi'jG 


Predicted outer membrane protein 


b3524 


1.61 


o s (9) 


vbeH 


Hypothetical protein 


b0625 


1.66 




vedS 


Unknown 


bl964 


1.69 




yfgJ 


Unknown, mutant affecting swarming motility 


b2510 


1.74 




G7353 


Phantom gene 


b2596 


1.79 


Upstream of a ribosome 










modulation factor induced 










in stationary phase (yfiA) 


yqiG 


Unknown, interrupted by IS element 


b3046 


1.81 




IG1006824 1007066-r 


Intergenic region including ycbX promoter 


N.A. 


2.26 




ycbX 


Unknown 


b0947 


2.05 




yM 


Unknown 


b2623 


2.30 




IG2755422 2755664-r 


Intergenic region downstream of yfjH 


N.A. 


2.01 




ychS 


Unknown 


bl228 


3.21 




Non-coding RNAs 








CRP, a E (77,78) 


ryeE (cyaR) 


Small RNA, promotes degradation of ompX 


b4438 


1.50 




and nadE RNA 








sgrSjsgrT 


Small RNA, inhibits ptsG translation/SgrT 


b4577 


1.51 


SgrR protein 




protein 








ECs3934 


sibDjsihE non coding RNA- ibsDjibsE 


b4447 


1.62 


Complex locus including two 




toxic peptides 


b4664 




non coding RNAs overlapped 










by two small ORFs encoding a 










putative toxin/antitoxin system 


micA 


micA small RNA (downregulates ompA 


b4442 


2.00 






expression) 








Intergenic regions, genes 


not annotated in MG1655 








c387S 


Unknown, annotated in CFT073 


N.A. 


1.50 




IG2922538_2922756-f 


Intergenic region between syd (predicted 


N.A. 


1.50 






protein) and csrB (ncRNA), antisense 








c5008 


Unknown, annotated in CFT073, 


N.A. 


1.50 






downstream of malM 








IG2438141 2438404-f 


Between fabB and ycfj, antisense 


N.A. 


1.50 




ECs5537 


Unknown, annotated in 0157:H7 SAKAI 


N.A. 


1.51 




IG2885243_2885600_r 


Intergenic region, upstream of predicted 


N.A. 


1.52 






helicase ygcB 








c4656 


Unknown, annotated in CFT073 


N.A. 


1.52 






(antisense of atpC) 








IG3665211 3665420-f 


Intergenic region upstream of gad A, antisense 


N.A. 


1.52 




clOW 


Unknown, annotated in CFT073, upstream 


N.A. 


1.52 


aqpZ (aquaporin) is a s -dependent 




of aqpZ 






(18,79) 


c0723 


Unknown, annotated in CFT073, upstream of 


N.A. 


1.54 






dacA, transcribed in opposite direction 








1G2481360J481774-I- 


Intergenic region upstream emrKY 


N.A. 


1.54 






(TolC-multidrug efflux pump) 








IG2228406_2228643-f 


Intergenic region upstream of yohJK (inner 


N.A. 


1.54 






membrane proteins) 








IG2201932 2202549-r 


Intergenic region between yehl-yehK, antisense 


N.A. 


1.54 




c2806 


Unknown, annotated in CFT073 


N.A. 


1.55 






(antisense of menB) 








c3010 


Unknown, annotated in CFT073, upstream of 


N.A. 


1.55 






perM gene, transcribed in opposite direction 








IG3669525 3669971-r 


Intergenic region, upstream of yhjB 


N.A. 


1.58 




IG2898371 289861 3-f 


Intergenic region upstream of yqcE 


N.A. 


1.60 




c5221 


Unknown, annotated in CFT073 


N.A. 


1.63 






(antisense of aspA) 








lG223409_223770-r 


Intergenic region upstream of rrsH ribosomal 


N.A. 


1.64 





operon (transcribed in opposite direction) 
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Table 1. Continued 



Genes and 


Gene product b 


b number 0 


Transcription 


Known regulatory factors 6 


promoter regions' 1 






levels 

(Eo- s /Eo- 70 

ratio) d 




IG1420MI 142tf)SX-r 

IUJtaI/OJ 1 mJ TiJ l/JO 1 


Tntcrffpnir' i*p<tii"M"i Hr^ix/nctri^'iTn r\T vrf t<' i*i nASA. 

111 LCI gCilJL, ICgltJll, LIU W 115 LI Lillll Ul II jl 1 1 UU5U 

mal o per on 


N.A. 


1.66 






Unknown, annotated in CFT073 (antisense of 
rplL) 


XT A 
1> ./\. 


1 A7 
1.0/ 




IGldlRIRd 2d290d1-f 

fu £1 £Q / Of- ii'rA jruti j 


111 LCI gGlllL. IGJiltJll, LlUSLlCcllll Lf 1 L\'Ur\ i^LllL, 

transcribed in opposite direction 


N.A. 


1.71 




Li yuo 


Unknown, annotated in CFT073 
(antisense of yddM) 


rN ./\. 


1 71 




c0703 


Unknown, annotated in CFT073 

f QtiticPticp rit /~ , itf-i'\ 
y txii Liaciisc *ji Liti j 


N.A. 


1.74 




ECs5165 


yjfO (biofilm-related protein) in 0157:H7 
SAKAI 


N.A. 


1.86 




IGlQftllRd 1001567 f 
I\i I y(/JZo*_i yujJO/-l 


Intergenic region downstream of yobD, 
transcribed in opposite direction 


M A 


1 87 
1 ,o / 




IG3358642 3358810-f 


Intergenic region downstream of gltD 


N.A. 


1.89 




IG2190243 2190534-r 


Intergenic region downstream of yehE 


N.A. 


1.94 




c3113 


Unknown, annotated in CFT073 (upstream 
of rrsG ribosomal operon, transcribed in 
opposite direction) 


N.A. 


1.95 




c2317 


Unknown, annotated in CFT073 
(antisense of azuC) 


N.A. 


1.99 


Complex locus including 
small RNA isrB 


IG2755422 2755664-r 


Intergenic region downstream yfjH 


N.A. 


2.01 




IG2519349 2519612-f 


Intergenic region downstream of xapR 


N.A. 


2.05 




IG330721 331594-r 


Intergenic region upstream of yahA, 
transcribed in opposite direction 


N.A. 


2.07 





"For known genes, we used the nomenclature reported in the NCBI database (http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi); intergenic region are 
indicated using the nomenclature by Affymetrix, which indicates the start and the end of the intergenic region (IG) covered by the various probe sets; 
-f or -r indicate if the probe sets are in forward or reverse orientation relative to the (+) strand of the E. coli chromosome; genes already described as 
/ycS-dependent in vivo are underlined. 
b Gene product (or predicted product). 

^Relative location of the various ORFs on the E. coli MG1655 chromosome. 
d Determined by microarray analysis as described in 'Materials and Methods' section. 
e When not otherwise stated, the information is taken from http://ecocyc.org/. 



contrast, no known r/?oS-dependent gene was found to be 
preferentially transcribed by Ea 70 (Table 2). However, 
several genes clearly identified by previous works as 
r/wS-dependent (e.g. katE, dps and gadA) only showed 
slight (< 1.5-fold) preferential recognition by Ect in our 
experimental conditions (Supplementary Table S2). This 
would suggest that, for these rpoS-dependent genes, 
promoter recognition by Ea s might be mediated by regu- 
latory proteins, or facilitated by additional factors such as 
DNA supercoiling or effector molecules (e.g. ppGpp), 
missing in ROMA experiments. Alternatively, in ROMA 
experiments, Ea 70 might recognize promoters which might 
not be accessible to this form of RNA polymerase in vivo, 
due to selective negative recognition by regulatory 
proteins such as H-NS (11,12). 

A significant fraction of genes more efficiently transcri- 
bed by Ect s is involved in sugar and in polyamine metab- 
olism, in response to cellular stresses and in nucleic acid 
synthesis, modification and turnover (Table 1), consistent 
with the role of a s as a starvation- and stress-related 
protein; no essential gene was found. Interestingly, 
several Ea s -dependent transcripts correspond to non- 
coding RNAs, intergenic regions and ORFs of unknown 
function only annotated in pathogenic E. coli strains. This 
last result was surprising, since the DNA template used in 



the in vitro transcription experiments came from E. coli 
MG1655; however, probe set analysis and sequence com- 
parison performed using BLAST (http://blast.ncbi.nlm 
.nih.gov/Blast.cgi) revealed that these ORFs are also 
present in MG1655, although they are not annotated in 
this strain. Several of these unlisted ORFs overlap known 
genes in an antisense direction, as indicated in Table 1. 
Interestingly, one of such ORFs (c3113, annotated in the 
uropathogenic strain CFT073), as well as one intergenic 
region (IG223409_223770-r), is located in the promoter 
regions of ribosomal operons rrsG, and rrsH in an anti- 
sense direction, possibly suggesting that they may play a 
role in Ecr s -dependent control of ribosomal operon 
transcription. 

In contrast, Ect 70 appeared to transcribe with higher- 
efficiency genes encoding ribosomal proteins and other 
protein synthesis-related genes, such as rRNA- and 
tRNA-encoding genes and prfB, encoding for release 
factor 2 (Table 2). Several of the protein synthesis-related 
genes listed in Table 2 are essential, as are the yjeE and the 
amiB genes, part of a multifunctional operon also prefer- 
entially transcribed by Ea 70 . Interestingly, regulatory 
genes directly affecting a 70 activity, such as relA, respon- 
sible for biosynthesis of the ppGpp alarmone in response 
to amino acid starvation (35), and ssrS, encoding a 6S 
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Table 2. Genes transcribed more efficiently in the presence of Ea 



Genes and 
promoter regions" 


Gene product b 


b number^ 


Transcription 
levels 

fFc 70 /Fo s 

ratio ) d 


Known regulatory factors 6 


Ribosomal genes and protein synthesis 








IG1286552_1286760-r 


Intergenic region including tyrT promoter 


N A 


9 Q7 


rid (ou) 


rttR 


Non-coding RNA, part of tyrT transcript 




1 Q7 


FTC (Q(Y\ 


tpr 


Small protamine-like protein part of tyrT 


bl 229 


2.03 






transcript 








serT 


Serine tRNAs gene 


b0971 


1.89 


FTS 1 7Q V ii nrptJi il a tpd in rnnS! 










mutant of MG1655 CI 71 

111 LI LCI 111 \J 1 1V± VJ l\J *J ~J 11 / I 


rpml 


Ribosomal protein L35 


b 1 7 1 7 


1.91 




pifB 


Release Factor RF2 


b2891 


1.91 


T Ttirptn ll a t pri in a r*irifilm-»Tt"r\\\/in <t 

L_J LIl CgLlltl LCVJ. Ill tl LJltJllllll gl v_J Wlilii 










1 pUO I11U Ld.Il L U.C1 1 Vd. LI VC UI 










IVIO 1 OJ J ^ 1 a) 


rimP 


Ribosomal maturation protein 


t>3 1 70 


2 01 




rpsF 


Ribosomal protein S6 


b4200 


2.04 




rpmG 


Ribosomal protein L33 




9 1 n 

Z. 1 u 




relA 


ppGpp alarmone biosynthetic enzyme 


b2784 


2.35 




rplU 


Ribosomal protein L21, in rplU-rpmA operon 


b3186 


2 50 




rpmA 


Ribosomal protein L27, in rplU-rpmA operon 


b3185 


2 58 




rpsU 


Ribosomal protein S21 


b3065 


2.78 


Upregulated in a biofilm-growing 










fim V mutant Hprivativp nf 

t IJUtJ UlLLLcXllL 1 M ill! V L K_f I 










MG1655 (18"! 

1V1 V J 1UJJ 1 1(J 1 


IG3426400J426656-I- 


Intergenic region including rmH promoter 


N A 


2 87 




DNA repair 










sul A 


SOS response inhibitor of cell division 


b0958 


2.04 


Lex A (81) 


dinl 


AP endonuclease, SOS response 


bl061 


2.18 


LexA (82)^ upregulated in an 










mn V mutant Hpri vativp nf 

1 IJUtJ UlLLLcXllL 1 M il u V L K_f I 










OH157:H7 EDL 933 (19) 


Multifunctional operons 










cvpA 


Colicin V production; in cvpA-pitrF-ubiX 


b2313 


1 85 


r LL1 1\. yo J lipi LllclLCLl 111 elll 




operon 






vpoS mutant derivative of 










OH1 S7- H7 FF)T Q^H (^9^ 

VJlll J / .11 / 1 ,l7L 7JJ 1 l7j 


purF 


Amidophosphoribosyl transferase 


b2312 


2.12 


oe evil A 




(ribonucleotide metabolism); in 










cvpA-purF-ubiX operon 








ubiX 


3-octaprenyl-4-hydroxybenzoate decarboxylase 


b231 1 


1 on 


as c vp A. 




(ubiquinone biosynthesis); in cvpA-purF-ubiX 










operon 








c2854 


Unknown, annotated in CFT073 as part of the 


N A 


9 00 






cvpA-purF-ubiX operon 








yjeF 


Putative carbohydrate kinase; in 


b4167 


1.88 






yjeFE-amiB-mutL operon 








yjeE 


Essential protein with weak ATPase activity; in 


b4168 


1.95 






yjeEF-amiB-mutL operon 








amiB 


yV-acetylmuramyl-L-alanine amidase needed for 


b4169 


2.02 






septum formation during cell division; in 










yjeEF-amiB-mutL operon. 








mutL 


Methyl-directed mismatch repair, subunit in 


b4170 


9 06 






MutHLS complex; in yjeEF-amiB-mutL 










operon. 








yhhE 


Inner membrane protein; in yhbE-obgE operon 


b3184 


2.36 




ohgE 


GTP-binding protein, involved in ppGpp 


b3183 


1.91 






turnover; in yhbE-obgE operon 








Transcription regulation 










IG3717398J 717677-f 


Intergenic region including cspA promoter 


NT A 


1 8^ 

1 .OJ 




cspA 


Cold shock protein A 


b3556 


2 34 


UpiC^UlcllCLl 111 d.11 / pUO UllLLclIll 










derivative 01 wriij/.ri/ iiljl^ 










933 (19) 


alpA 


CP4-57 prophage gene, regulator of tmRNAs 


b2624 


2.31 




Metabolic functions 










fhuF 


Iron reductase 


b4367 


1.83 


Fur, OxyR (84) 


ydhR 


Putative monooxygenase 


bl667 


1.91 




artJ 


Arginine transporter 


b0860 


2.13 


Upregulated in a biofilm-growing 










rpoS mutant derivative of 










MG1655 (18) 


Unknown and miscellaneous functions 








ycgY 


Unknown 


bl!96 


1.83 
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Table 2. Continued 



Genes and 
promoter regions" 



Gene product* 3 



b number 0 



Transcription 
levels 
(Es 70 /Es s 
ratio) d 



Known regulatory factors 6 



b2017 


1.85 


b2119 


1.92 


U 1 8 0 1 
D 1 oZ 1 


Z.UO 


bl705 


2.10 


b4434 


2.25 


b3864 


2.72 


DZ? 1 1 


1 1A 


XT A 
IN ./\. 


1 8"* 

1 .OJ 


N.A. 


1.85 


XT A 


1 fi£ 


N.A. 


1.92 


N.A. 


1.92 


N.A. 


1.94 


XT A 
IN . . \ . 


1 Q£ 

1 .7U 


XT A 
IN ./\. 


1 QS 

1 .70 


XT A 
IN . . \ . 


Z.U3 


XT A 
IN . . \ . 


Z. 1 J 


N.A. 


2.16 


XT A 
IN .A. 


z.z / 


N.A. 


2.31 


N.A. 


2.35 


N.A. 


2.42 


N.A. 


2.44 


N.A. 


2.49 


N.A. 


2.54 


N.A. 


2.99 


N.A. 


3.10 


N.A. 


3.45 


N.A. 


4.02 


N.A. 


4.44 



yefM 
yehL 

yehN 
ydiE 

Non-coding RNAs 
isrB 

ssrS 

Intergenic regions, genes 
cl714 

IG330721_331594-f 
ECsl613 

IG2424809_2425028-r 
IG1120179_1120464-r 

c2481 

IG127588_127911-f 

IG2404662_2405580-r 

Z0043 

c2568 

Z5055 

c4052 

IG2815526J815805-I- 

IG583654J83902-I- 

c2230 

IG 19056 16 J906284-f 
Z3239 

c4719 
c4352 

IG2428784_2429041-f 

cl434 

Z5945 

Z5868 



Antitoxin in yefM-yoeB toxin-antitoxin system 
Unknown, possible component of ABC 

transport system 
Unknown, putative membrane protein 
Putative lipoprotein 

Small RNA 

Small RNA, regulates DNA polymerase I activity 
6S RNA, modulates cr 70 activity 

not annotated in MG1655 

Unknown, annotated in CFT073, upstream of 
els, transcribed in opposite direction 

Intergenic region including yahA promoter 
region 

Unknown, annotated in 0157:H7 SAKAI, 

possible prophage gene (renD) 
Intergenic region between argT and hisJ 
Intergenic region upstream bssS (regulator of 

biofilm formation) 
Unknown, annotated in CFT073, upstream of 

cobU 

Intergenic region upstream of Ipd 
Intergenic region upstream of IrhA 
Annotated in 0157:H7 OH157:H7 EDL 933, caiC 
Annotated in CFT073, wcaM 
Annotated in 0157:H7 OH157:H7 EDL 933, 
rfaG 

Unknown, annotated in CFT073 
Intergenic region upstream yqaBA 
Intergenic region downstream of ompT 
Unknown, annotated in CFT073 (antisense of 
cspC) 

Intergenic region, between vobF and yebO 
Annotated in 0157:H7 OH157:H7 EDL 933 

(antisense of yegl) 
Annotated in CFT073 homologous to aslA 
Annotated in CFT073 as dppC 
Intergenic region, between yobD and yebN 
Unknown, annotated in CFT073, vdfR 
Annotated in 0157:H7 OH157:H7 EDL 933 

(toxic peptide) 
Annotated in 0157:H7 OH157:H7 EDL 933; 

yjgM, putative acetyltransferase 



Upregulated in a ipoS mutant 
derivative of MG1655 (20) 



"For known genes, we used the nomenclature reported in the NCBI database (http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi); intergenic region are 
indicated using the nomenclature by Affymetrix, which indicates the start and the end of the intergenic region (IG) covered by the various probe sets; 
-f or -r indicate if the probe sets are in forward or reverse orientation relative to the (+) strand of the E. cob chromosome; genes previously described 
as being upregulated in an ipoS mutant strain are double underlined. 
b Gene product (or predicted product). 

^Relative location of the various ORFs on the E. coli MG1655 chromosome. 
d Determined by microarray analysis as described in 'Materials and Methods' section. 
e When not otherwise stated, the information is taken from http://ecocyc.org/. 



RNA able to modulate Ecr -dependent at several pro- 
moters (36), showed dependence on Ect 70 in ROMA ex- 
periments (Table 2). For ssrS, our data confirm literature 
data showing that the main ssrS promoter (ssrS PI) is 
strictly Ecr 70 -dependent in vitro (37). Interestingly, 
several genes preferentially transcribed by Ect 70 in 
ROMA experiments are upregulated in rpoS mutant 
strains (underlined in Table 2); it has been proposed that 
negative regulation by a s can depend on competition with 



a for a limiting amount of RNA polymerase core 
enzyme, which results in lower intracellular Ecr 70 concen- 
trations and, in turn, in impaired transcription initiation 
at strictly a 70 -dependent promoters (38). 

In vivo validation of Ea s -dependent genes identified in 
ROMA experiments 

In order to validate the results of ROMA experiments, we 
tested in vivo expression of 18 genes preferentially 
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transcribed by Ea s , comparing their transcript levels in 
MG1655 and in its rpoS mutant derivative EB1.3. 
Transcript levels were determined by quantitative real- 
time PCR. Different genetic backgrounds (MG1655, 
MC4100 and OH157:H7 EDL933 strains) and different 
growth conditions (LB medium, glucose-based medium 
and biofilm growth) can strongly affect r/wS-dependent 
gene expression (9,17-20), suggesting that most genes be- 
longing to the a s regulon are also subject to additional 
forms of gene expression regulation. Thus, in vivo gene 
expression was determined in three different growth 
media: the peptone-based LB medium, either full 
strength or diluted 1:4, and the glucose-based M9Glu/ 
sup medium (Table 3). In addition, we took samples 
from cultures both in late exponential phase and in late 
stationary phase of growth; indeed, although a s -mediated 
gene expression is typically associated with stationary 
phase, several r/wS'-dependent genes are maximally 
expressed at the transition between exponential and 
stationary phase or even in mid-exponential phase 
(19,39; Landini,P., unpublished data). 

For five genes (recT, hyaA, gabD, puuB and treF), de- 
pendence on a functional rpoS gene has been described 
(Table 1); however, for the recT gene, part of the 
recET-lar-ydaCQ operon, both positive and negative 
control by rpoS was reported (17,18). As a positive 
control, we tested the expression of dps that, although 
showing only weak dependence on Ecr s in our ROMA 
experiments (Supplementary Table S2), has consistently 
been described as r/wS'-dependent in vivo in several 



Table 3. Gene expression in bacterial cells 



Gene 




LB 


M9Glu/sup 


LB 1/4 


Exp 


Stat 


Exp 


Stat 


Exp 


Stat 


ychS 


2.4 


0.1 


1.1 


6.7 


1.7 


2.0 


gshA 


1.0 


1.5 


1.0 


1.1 


1.9 


3.3 


ycbX 


1.1 


0.5 


0.7 


2.0 


1.5 


2.6 


rec T 


0.9 


0.4 


0.7 


2.5 


1.8 


2.0 


rhaS 


1.8 


0.3 


1.5 


2.7 


1.5 


2.7 


cyaR 


1.0 


1.7 


1.5 


10.0 


0.8 


0.6 


ilvY 


1.7 


0.3 


1.0 


10.2 


1.9 


2.2 


hyaA 


1.2 


5.6 


2.1 


2.9 


2.3 


17.5 


speB 


1.5 


1.3 


1.4 


2.7 


1.2 


86.3 


gyrB 


1.4 


0.8 


0.9 


2.8 


1.0 


3.1 


gabD 


4.8 


0.7 


12.1 


1.2 


5.1 


69.0 


ydhY 


1.5 


0.3 


0.9 


19.2 


1.1 


2.1 


appY 


1.2 


0.7 


0.9 


3.4 


2.0 


11.4 


puuB 


1.7 


0.2 


0.6 


3.5 


3.0 


1.8 


treF 


1.5 


3.(1 


7.2 


0.6 


2.1 


52.6 


bsmA 


1.1 


1.0 


2.3 


0.9 


1.0 


2.1 


xapA 


2.4 


0.2 


0.8 


4.3 


1.1 


2.7 


cHB 


1.0 


1.4 


1.3 


8.1 


2.7 


1.3 


dps 


0.9 


10.7 


10.7 


0.3 


14.2 


30.3 


cspA 


1.3 


0.2 


1.1 


0.8 


0.7 


0.2 



Relative expression is indicated as WT/rpoS ratio. 

Values higher than 2.5-fold were considered significant and are shown 

in boldface type. 

Values are the average of two independent experiments performed in 
duplicate. 



reports (9,16,17); in agreement with the literature data, 
dps expression showed strong r/wS-dependence in all con- 
ditions tested (Table 3). We also tested the expression of 
cspA, which was transcribed with higher efficiency by Ect 70 
in ROMA experiments (Table 2); consistent with this 
result, in vivo gene expression studies show that a func- 
tional rpoS allele is not required for cspA expression, 
which is in fact upregulated in the rpoS mutant when 
grown in peptone-based media (Table 3), in agreement 
with previous observations (20). For in vivo gene expres- 
sion experiments, we considered as significant a fold dif- 
ference >2.5 in WT versus rpoS mutant relative expression 
ratio. 

As shown in Table 3, growth conditions strongly 
affected gene expression: in LB medium, expression of 
only three genes {hyaA, gabD and treF) was dependent 
on a functional rpoS allele, while seven genes were in 
fact upregulated in the rpoS mutant, suggesting negative 
control by ct s . In contrast, 12 genes were expressed in an 
r/wS-dependent fashion when bacteria were grown in 
1:4 diluted LB, while 15 genes showed dependence on 
rpoS in M9Glu/sup medium (Table 3). Of all genes 
tested, only the bsmA gene did not show relative expres- 
sion values >2.5 in any growth condition, while recT and 
ycbX only displayed weak dependence on a functional 
rpoS allele (2.5- and 2.6-fold increase in one growth con- 
dition). All other genes showed either strong dependence 
on the rpoS gene (up to 86.3-fold for speB in LB1/4 
medium) or were affected by the rpoS mutation in more 
than one growth condition (e.g. rhaS, induced 2.7-fold in 
the wild-type strain both in M9Glu/sup and in LB 1/4 
growth media). Finally, in vivo gene expression experi- 
ments demonstrated that the c3113 gene, tested as a rep- 
resentative of ORFs only annotated in the uropathogenic 
E. coli CFT073, is indeed expressed in MG1655, and it 
shows dependence on the rpoS gene when bacteria are 
grown in M9Glu/sup (Table 3). 

In vitro transcription assays on single promoters 

To further confirm the results of ROMA experiments, we 
performed in vitro transcription assays comparing Ect s 
and Ect 70 on single promoters. To this aim, promoter 
regions of the ilvY, speB, xapA, ydhY and ssrS genes 
were cloned into the pJCDOl vector (22) and the 
obtained plasmids were used as DNA template for 
in vitro transcription. The ilvY, speB, xapA and ydhY 
genes were selected since their promoter region has been 
experimentally identified and they show r/wS-dependence 
in vivo (Table 3). In contrast, ssrS shows the highest de- 
pendence on Ect 70 in ROMA experiments among genes 
whose promoter has been characterized (Table 2). 
Strong dependence on Ect 70 is in agreement with 
previous results showing that ssrS PI, the main ssrS 
promoter, is Ea 70 -dependent in vitro (37), and consistent 
with upregulation of ssrS in an rpoS mutant of MG1655 
(20). Transcription of the different genes of interest was 
normalized to the RNA-I transcript, as previously 
described (22), and transcript quantitation was performed 
by real-time PCR as described in 'Materials and Methods' 
section. 
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As shown in Figure 1 , in vitro transcription experiments 
performed on plasmids showed that the ilvY, speB, xapA 
and ydh Y are transcribed more efficiently in the presence 
of Ea s . Extent of dependence on Ea s was higher than in 
ROMA experiments, ranging from a 3.4-fold difference 
for speB to a 10.2-fold difference for xapA, thus suggesting 
that, at least at these promoters, DNA supercoiling does 
not negatively affect promoter recognition by Ea s . In 
contrast, the ssrS PI promoter showed clear dependence 
on Ea 70 , in agreement with previous observations (37). 
Thus, results of in vitro transcription experiments per- 
formed on single promoters were fully consistent with 
ROMA experiments. 

Sequence elements involved in specific Ea s - versus Ea 70 
recognition of promoter regions 

The genes identified in ROMA experiments define, at least 
partially, what can be considered as the 'core a s regulon', 
i.e. a set of genes whose transcription is directly controlled 
by Ea s . The core a s regulon can be opposed to the 
'expanded a s regulon', i.e. genes dependent on a function- 
al rpoS allele in vivo, whose promoters are, however, not 
necessarily recognized by Ea s . In order to identify 
sequence features important for specific promoter recog- 
nition by Ea s , it can be very informative to compare the 
sequences of Ea s -dependent promoters; however, it is im- 
portant to limit this comparison to the promoters that are 
exclusively, or at least preferentially, under Ea control. 
Thus, we performed a sequence alignment on the 
promoter sequences of genes preferentially transcribed 
by either Ea or Ea 70 . Known promoter sequences were 
retrieved from the Ecocyc database (http://ecocyc.org/): 
only promoters whose transcription initiation start site 
had been experimentally determined were considered for 
sequence alignment. However, since many /-/wS-dependent 
genes are controlled by multiple promoters in vivo, 



suggesting complex regulation that might involve different 
sigma factors, we verified that their transcription start sites 
in the in vitro transcription reactions did indeed corres- 
pond to the transcription sites reported in the literature. 
Thus, using RACE analysis, we determined transcription 
start sites on in vitro transcription assays carried out with 
Ea s and performed as in the ROMA experiment (data not 
shown): we were able to identify precisely the in vitro tran- 
scription start sites at 31 promoter regions controlling 29 
different genes, as listed in Supplementary File SI. For 
most genes with already known promoter regions, we 
could confirm the transcription start observed in vivo, 
although for several genes reported to be controlled by 
multiple promoters, only one promoter was found to be 
recognized by Ea s in vitro (e.g. osmB, otsB, glgC and 
murQ). For the gabD gene, two of the three promoters 
described as functional in the bacterial cell were also 
recognized by Ea s in the in vitro transcription assays. In 
contrast, for the araF gene, we identified a second 
promoter additional to the one already described in the 
literature. Finally, we determined the transcription start 
sites for eight genes (or operons) preferentially transcribed 
by Ea s in ROMA experiments with yet-unknown tran- 
scription start sites, namely puuCBE, tarn, treF, yciT, 
yffOP, yhjG, yi91a and yqiG: their transcription start site 
and putative promoter elements are listed in 
Supplementary File SI. For Ea 70 -dependent promoters, 
we verified the transcription start sites for the ssrS tran- 
script obtained in ROMA experiments performed with 
Ea °: the transcription start site matched the known tran- 
scription site for the ssrS PI promoter previously 
identified (37). 

For promoter sequence comparison, we considered 
DNA sequences extending from —100 to +2 bp relative 
to the transcription start site, i.e. an area that includes 
all promoter elements described for Ea 70 . For sequence 
analysis, we divided the —100 to +2 promoter sequence in 
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Figure 1. In vitro transcription experiments on supercoiled plasmids. Transcription from the ilvY, speB, xapA, ydhY and ssrS PI promoter regions 
cloned into pJCDOl plasmid was performed in the presence of either Ea s or Ect 70 ; transcript amounts were determined by quantitative real-time 
PCR as described in 'Materials and Methods' section, using the RNA-I transcript as reference as previously described (22). Relative transcript levels 
are shown as Ecr s /Ea 70 ratio. Experiments were performed three times in duplicate, and standard errors are shown. 



Nucleic Acids Research, 2011, Vol. 39, No. 13 5349 



three parts: the —17 to +2 region, carrying the —10 element 
and the transcription start site; the —60 to —18 region, con- 
taining the UP element and the —35 sequence; and the —100 
to —61 region. Alignment of the —17 to +2 regions was 
centred on the first nucleotide of the —10 element (conven- 
tionally referred to as the —12 position, Figure 2B). For the 



—60 to —18 regions, the alignment was centred either on 
the first nucleotide of the —35 element (when present) or 
on the nucleotide located 22 bp upstream of the —10 
element, which would correspond to the first nucleotide 
of a hypothetical —35 element placed at the optimal 17 bp 
distance from the —10 sequence (Figure 2A). In total, 
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Figure 2. Conserved sequence features in promoters of genes showing preferential transcription by either Ea or Ea , shown as a sequence logo 
derived from multiple sequence alignments. (A) Comparison of alignments in the —60 to —18 promoter regions. (B) Comparison of alignments in the 
— 17 to +2 promoter regions. Note that different j-axis scales were used in the two panels to account for the different levels of sequence conservation. 
Multiple alignment included 31 promoters controlling genes preferentially transcribed by Ea in ROMA experiments (listed in Supplementary File 
SI) and 29 promoters preferentially recognized by Ea 70 (listed in Supplementary File S2). 
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31 promoters showing preferential recognition by Ect s 
(listed in Supplementary File SI) and 30 promoters pref- 
erentially recognized by Ect 70 (Supplementary File S2) 
were selected for sequence analysis. Consensus sequence 
conservation within promoters preferentially recognized 
by either Ect s or Ect 0 was displayed as sequence logos 
using the Weblogo 2.8.2 application (40) (http:// 
weblogo.berkeley.edu/). Results of sequence analysis 
showed that, as expected, the most conserved sequences 
corresponded to the known promoter elements and were 
comprised between —60 and +2 (Figure 2). Comparison of 
the sequences located further upstream (—100 to —61) is 
shown in Supplementary Figure S2. 

For promoters more efficiently recognized by Ect 70 , the 
—35 and —10 sequences, i.e. the most important Ecj 70 -de- 
pendent promoter elements, are clearly recognizable in the 
sequence logos (Figure 2). In addition, the —60/— 40 
region is characterized by a high occurrence of A and T 
residues (Figure 2A), consistent with strong conservation 
of the UP element, i.e. a binding site for the a subunit of 
RNA polymerase (41). Other than the already known 
promoter features for ct 70 , only the transcription start 
site (— 3/+1) showed a moderately conserved sequence 
(CCCG, Figure 2B). 

Several differences in conserved regions were detectable 
in the promoter set for genes more efficiently transcribed 
by Ect in ROMA experiments. The —10 sequence is 
clearly the main conserved promoter element in this set 
(Figure 2B). No conserved sequences were detectable in 
the —35 region (Figure 2 A), in agreement with previous 
works reporting that the —35 element does not play an 
important role in promoter recognition by Ect s (42,43), 
while a weakly conserved sequence similar to a —35 
element seems to be located at around —30 (Figure 2A). 
Another difference between the two promoter sets resides 
in the lack of an A/T rich region between —60 and —40 
(Figure 2A). In contrast, however, promoters more effi- 
ciently transcribed by Ect s seem to possess an increased 
occurrence of T residues in the —90 to —70 region 
(Supplementary Figure S2). Finally, the discriminator, 
i.e. the sequence located between the —10 element and 
the transcription start site, appears to be biased towards 
a high A/T content in promoters transcribed more effi- 
ciently by Ect s . In particular, T residues appear to be 
conserved at positions —6, —2 and —1. Rather than 
being associated to specific nucleotide position, however, 
the bias towards an A/T-rich discriminator seems to be a 
common feature in promoters of genes preferentially 
transcribed by Ect s . Indeed, while the average length of 
the discriminator is identical for both Ect 70 - and Ecj s -de- 
pendent promoters (6.1 bp, Supplementary Table S3), the 
GC content is significantly higher in Ecj 70 -dependent pro- 
moters (0.58 versus 0.41, Supplementary Table S3). 

Finally, an additional deviation between Ect °- and 
Ecj s -specific promoter elements could be observed imme- 
diately upstream of the —10 hexamer: the ECT 70 -dependent 
promoters showed some conservation of a TGG motif 
upstream of the —10 [the 'extended —10' (44)], which 
was replaced by a TNGC motif, which includes the char- 
acteristic — 13C element, in ECT S -dependent promoters. 



Mutations in the discriminator region affect selective 
promoter recognition at the Eo 70 -dependent 
promoter ssrS PI 

It might be inferred that differences in the specific 
elements of the promoter regions might account for dif- 
ferent transcription efficiency by either Ect s or Ect 70 in 
ROMA experiments. In particular, we investigated 
whether the presence of an A/T rich region immediately 
downstream of the —10 element, conserved among pro- 
moters preferentially recognized by Ect s in ROMA experi- 
ment (Figure 2B), could indeed be a determinant for 
selective promoter recognition by Ect s . To test this possi- 
bility, we targeted for mutagenesis the ssrS PI promoter, 
which shows preferential recognition by Ect 0 in our 
in vitro assays (Table 1 and Figure 1) and whose depend- 
ence on Ect had already been reported in the literature 
(37). We changed the CG nucleotides at positions —3/— 2 
of the ssrS PI promoter to TA (Figure 3), thus making the 
— 6/+1 region of the promoter A/T-rich. In vitro transcrip- 
tion experiments on supercoiled templates in the presence 
of either Ect s or Ect 70 showed that CG to TA substitutions 
at positions —3/— 2 of the ssrS PI promoter resulted in loss 
of specific recognition by Ect 70 from 3.5- to 1.6-fold 
(Figure 3), suggesting that the A/T content in the — 6/+1 
region can indeed play a role in modulating transcription 
efficiency by either Ect s or Ect 70 . 

DISCUSSION 

In this work, we have attempted to identify bona fide Ect s - 
dependent promoters through in vitro transcription experi- 
ments using the whole E. coli genome as template, 
followed by identification of Ecr s -dependent transcripts 

-10 +1 
GTGG TAGAGT AACCGT G ssrS PI WT 
IA ssrS PI mut 



4 , ssrS PI WT ssrS PI mut 




Figure 3. In vitro transcription experiments on pJCDOl plasmid deriva- 
tives in which either the ssrS PI promoter (ssrS PI WT) or a mutated 
derivative (ssrS PI mut) had been cloned. The ssrS PI mut carries a 
double substitution (CG to TA) at positions —3/— 2 (shown in the 
figure). In vitro transcription was performed in the presence of either 
Eo s or Ect 70 ; transcript amounts were determined by quantitative 
real-time PCR as described in 'Materials and Methods' section, using 
the RNA-I transcript as reference as previously described (22). 
Experiments were performed three times in duplicate, and standard 
errors are shown. 
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by microarray analysis (ROMA experiments). The in vitro 
transcription experiments have been performed on linear 
DNA, in the absence of any additional factor or molecule 
able to affect transcription initiation by either Ect s or Ect 70 
(i.e. DNA supercoiling, transcription regulators, histone- 
like proteins, ppGpp), and thus they represent a direct 
measurement of sequence-specific interactions between 
promoters and RNA polymerase. Our results indicate 
that selective promoter utilization by Ea 70 and Ea s is 
mediated by specific sequence features, and confirm the 
role of CT S -specific promoter elements previously identified. 
Promoters more efficiently recognized by Ea 7 in ROMA 
experiments are characterized by a high occurrence of 
the UP-like element, i.e. an A/T-rich region located imme- 
diately upstream of the —35 sequence and acting as 
a binding site for the a subunit of RNA polymerase 
(41), suggesting that, at least in vitro, the UP element 
might favour promoter recognition by Ect 70 . It is worth 
mentioning that, in Bacillus subtilis, UP elements are 
highly conserved among a A -dependent promoters (45), 
thus suggesting that UP elements might favour promoter 
recognition by the housekeeping ct factor in different 
bacteria. In contrast, UP-like elements flanking the —35 
sequence are less conserved in promoters better recognized 
by Ea s , where, however, A/T-rich elements seem to be 
scattered in the region spanning 70— 90 nt upstream 
of the transcription start (Supplementary Figure S2). 
It might be speculated that, similar to the UP element 
for Ea 70 -dependent promoters, AT-rich sequences 
located in the —70 to —90 region might also be involved 
in interaction with RNA polymerase a subunit at (^-de- 
pendent promoters. It is conceivable that the a subunit 
might contact alternative upstream promoter elements 
when assembled in different forms of RNA polymerase 
holoenzyme; indeed, differential ability to interact with 
UP elements has already been described for Ea 70 and 
Ea s (46). 

A sequence element showing strong differences between 
the two promoter sets is the discriminator, i.e. the region 
spanning between —10 and +1. This region shows a high 
T/A content for promoters more efficiently recognized by 
Ea s ; in contrast, Ea 70 -dependent promoters identified in 
ROMA experiments are biased towards a CG-rich region 
in the — 3/+1 residues (CCCG, Figure 2). Mutations in the 
discriminator region of the Ecr 70 -dependent ssrS PI 
promoter increasing its A/T content result in partial loss 
of preferential recognition by Ea 70 (Figure 3), providing 
further confirmation for a role of the — 10/+ 1 region in 
specific recognition by either form of RNA polymerase. 
GC versus AT content in the discriminator region can 
directly affect promoter melting, and GC-rich discrimin- 
ators are a common feature among promoters subject to 
negative regulation by ppGpp (47,48), which, indeed, 
inhibits Ect °-dependent transcription while promoting 
transcription by alternative forms of RNA polymerase 
(49). 

The sequence features indicated by our promoter 
analysis are consistent with previous observations based 
on sequence analysis of rpoS'-dependent promoters in the 
bacterial cell (4); this also includes the fairly strong con- 
servation of a C nucleotide immediately upstream of the 



— 10 hexamer (the — 13 C element) among cr s -dependent 
promoters. Indeed, the — 13 C element occurs in almost 
half of the promoters transcribed more efficiently by 
Ect s in ROMA experiments (14 out of 31 promoters; 
Supplementary File SI), opposed to a much lower fre- 
quency in Ecj 70 -dependent promoters (4 out of 29, 
Supplementary File S2). This result is not particularly 
surprising: indeed, a large amount of data has clearly 
shown that the presence of a C residue immediately 
upstream of the — lOnt favours promoter recognition by 
Ect s (7,8,50). The — 13C element might play a similar role 
as the TG motif at E<r 70 -dependent promoters lacking a 
—35 region (44). It is noteworthy that an in vivo analysis 
suggests that the — 13C element occurs in more than 70% 
of putative r/wS-dependent promoters (4): this observa- 
tion would suggest that the — 13C element might be 
needed to improve Ecj s -promoter interaction in the bac- 
terial cell, possibly to overcome the negative effects of 
DNA-binding proteins, such as H-NS, that can 
modulate RNA polymerase-promoter interaction. 

The results of the ROMA approach have also expanded 
our knowledge of the cr s regulon: the ct s protein is con- 
sidered a central element of the so-called 'general stress 
response' (51). Intracellular ct s concentration and expres- 
sion of cj s -dependent genes respond to reduction in 
growth rate (52); thus, any cellular stress affecting 
growth rate is likely to induce ct s accumulation, which 
in turn plays a direct role in oxidative, acid and osmotic 
stress through activation of specific genes. In addition, ct 
activates metabolic genes associated to stationary-phase 
metabolism, in particular carbon storage genes involved 
in glycogen (19,20) and trehalose metabolism (9,20); con- 
sistent with these observations, stress response and carbon 
metabolism genes are highly represented among genes 
preferentially transcribed by Ect s in ROMA experiments 
(Table 1). In addition, our results underline the import- 
ance of ct s for the expression of genes involved in poly- 
amine metabolism. Polyamines, in particular putrescine, 
play an important role in various cellular processes and 
can affect intracellular concentrations of the regulatory 
proteins ct n , Cra and H-NS, thus impacting global gene 
expression (53). As shown in Figure 4, putrescine, the 
most abundant polyamine in bacterial cells (54), is a 
product of arginine degradation. Putrescine accumulates 
at the transition between exponential and stationary phase 
(55), and it can be subsequently converted into other 
polyamines or degraded to succinate, which can be 
shunted into the tricarboxylic acid (TCA) cycle, in a 
NADPH-generating process (Figure 4). Interestingly, 
mutants unable to synthesize polyamines are more sensi- 
tive to oxidative stress, since they cannot induce ahpC, 
katE and katG genes, encoding three different peroxidase, 
suggesting that polyamine accumulation might control ex- 
pression of rpoS-dependent genes involved in the response 
to oxidative stress (54). In turn, our results indicate that 
genes encoding enzymes involved in both accumulation 
and degradation of putrescine, one of the main poly- 
amines found in the bacterial cell, are r/wS-dependent. 
Indeed, the speB gene, encoding the putrescine- 
biosynthetic enzyme agmatinase, is preferentially trans- 
cribed by Ect s in vitro (Table 1 and Figure 1) and is 



5352 Nucleic Acids Research, 2011, Vol. 39, No. 13 



L-arginine 



H 

C0 2 

H 2 0 
urea 



51 

agmatine 



arginine decarboxylase 
adiA 



agmatinase 
speB 



putrescine 



L-glutamate + ATP 



51 



qlutamine-putrescine lyase 
puuA 



2H* + ADP + Pi 

y-glutamyl-L-putrescine 



0 2 + H 2 0 



5| 



y-glutamylputresclne oxydase 
puuB 



NH„* + H z O 

y-glutamyl-y-aminobutyraldehyde 

NADP* + H 2 0 



51 



NADPH + H 

4-(glutamylamino)-butanoate 



I 1 

I y-glutamyl-y-aminobutyraldehyde | 

dehydrogenase 
I puuC | 



H 2 0 

L-glutamate 



v-qlutamvl-v-aminobutvrate 
hydrolase 
puuD 



2-oxoglutarate 



4- aminobutyrate 
i 1 




L-glutamate 

succinate semialdehyde 



4-aminobutyrate aminotransferase I 
I puuE I 

I I 

I 4-aminobutyrate transaminase | 

I gabT 



NADP* + H,0 



NADPH + H* 



succinate semialdehyde dehydrogenase 
NADP-dependent 
gabD 



succinate 

Figure 4. Schematic representation of a biochemical pathway under the 
control of the rpoS gene: arginine degradation to succinate via 
putrescine (adapted from EcoCyc; http://ecocyc.org/). Genes belonging 
to this pathway and found to be preferentially transcribed by Ea in 
ROMA experiments are boxed with solid lines (speB, puuB and gabD); 
genes co-transcribed with either puuB or gabD (i.e. belonging to the 
puuCBE and gabDTP operons; see also Supplementary Table 1) are 
boxed with dashed lines. Other genes of the pathway known to be 
dependent on the rpoS gene in vivo (puuA and puuD) (20,55) are 
underlined. 



/-/wS-depeiideiit in vivo, in particular in LB 1/4 medium 
(Table 3). Likewise, the genes involved in putrescine deg- 
radation, belonging to the puuCBE and gabDTP operons, 
are efficiently transcribed by Ea s in ROMA experiments 
(Table 1 and Supplementary Table SI) and show depend- 
ence on a functional rpoS gene in vivo (Table 3). 



The gabDTP operon, as well as the puuA and puuD 
genes, also involved in putrescine degradation, have 
already been reported to be r/wS'-dependent in vivo 
(20,55,56). Thus, a s could control every step in putrescine 
metabolism by regulating genes responsible for both its 
biosynthesis and its degradation, as shown in Figure 4. 
Activation of putrescine biosynthesis rather than degrad- 
ation might respond to different growth conditions and 
environmental cues, as also suggested by the very different 
levels of r/wS-dependent regulation of putrescine-related 
genes (i.e. speB, gabD and puuB) in different growth media 
(Table 3). 

r/wS-dependent control of intracellular polyamines con- 
centrations represents an important mechanism of indirect 
gene regulation by the a s protein, since polyamines act as 
signal molecules able to impact global transcription 
pattern in the bacterial cell (53). Indirect control of gene 
expression by a s can also occur through activation of 
regulatory proteins and of regulatory RNAs. Indeed, 
ROMA experiments would suggest that Ea s directly 
controls at least four genes encoding regulatory proteins 
(appY, ilvY, rhaS and yciT) and four genetic loci encoding 
regulatory RNAs (cyaR, micA, sgrSjsgrT and ECs3934) 
(Table 1). Interestingly, the sgrSjsgrT locus, encoding 
both a non-coding RNA (SgrS) and a small regulatory 
protein (SgrT), negatively affects expression of the ptsG 
glucose uptake system (57), which is overexpressed in rpoS 
mutant strains of E. coli (20,58). However, the extent of 
cr s -dependent indirect regulation of gene expression 
through non-coding RNA might not be limited to these 
four loci. Indeed, a significant number of intergenic 
regions, often located immediately upstream of known 
ORFs in antisense direction, was efficiently transcribed 
by Ea s (Table 1). It is possible that at least a part of 
these intergenic regions might indeed be transcribed in 
the bacterial cell and function as cw-acting regulatory 
RNAs. Similarly, several ORFs only annotated in patho- 
genic E. coli strains were detectable as transcripts in the 
ROMA experiment, although MG1655 genomic DNA 
had been used in the experiments. Sequence comparison 
allowed us to determine that these ORFs are indeed 
present in MG1655, but they are not accounted for in 
the available databases. As observed for intergenic 
regions efficiently transcribed by Ect s , several of these 
ORFs also overlap known genes in an antisense direction, 
suggesting that at least some of them might be involved 
in modulating gene expression. One ORF, c3113, par- 
tially overlaps the promoter region of the rrsG ribosomal 
operon, and quantitative real-time PCR experiments 
suggest that it is transcribed in a stationary-phase- 
dependent and r/wS-dependent manner in bacterial cells 
grown in glucose-based medium (Table 3). Similar to 
c3113, an intergenic region showing preferential transcrip- 
tion by Ea s (IG223409_223770-r) also partially overlaps 
the ribosomal operon rrsH, but in an antisense direction. 
Thus, our observations would suggest that small proteins 
or non-coding RNAs might play an important role in 
rpoS'-dependent negative regulation of genes associated 
with fast growth, such as ribosomal operons and the 
ptsG glucose uptake system. 
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